Henry County
What Is The Political Content in LLMs' Pre- and Post-Training Data?
Ceron, Tanise, Nikolaev, Dmitry, Stammbach, Dominik, Nozza, Debora
Large language models (LLMs) are known to generate politically biased text, yet how such biases arise remains unclear. A crucial step toward answering this question is the analysis of training data, whose political content remains largely underexplored in current LLM research. To address this gap, we present in this paper an analysis of the pre- and post-training corpora of OLMO2, the largest fully open-source model released together with its complete dataset. From these corpora, we draw large random samples, automatically annotate documents for political orientation, and analyze their source domains and content. We then assess how political content in the training data correlates with models' stance on specific policy issues. Our analysis shows that left-leaning documents predominate across datasets, with pre-training corpora containing significantly more politically engaged content than post-training data. We also find that left- and right-leaning documents frame similar topics through distinct values and sources of legitimacy. Finally, the predominant stance in the training data strongly correlates with models' political biases when evaluated on policy issues. These findings underscore the need to integrate political content analysis into future data curation pipelines as well as in-depth documentation of filtering strategies for transparency.
- Media > News (1.00)
- Law > Civil Rights & Constitutional Law (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- (14 more...)
Alum gives IU $60M for artificial intelligence center
By The Associated Press BLOOMINGTON -- An Indiana University alumnus who founded a technology company has donated $60 million to IU for the creation of an artificial intelligence center. ServiceNow founder Fred Luddy made the donation, which is the second-largest private gift IU has ever received. ServiceNow is a cloud computing company based in Santa Clara, California. IU says Luddy's $60 million donation will finance the creation of an artificial intelligence initiative focused on digital health. It will be based in what's now known as the IU Luddy School of Informatics, Computing and Engineering, but will be renamed the Luddy School of Informatics, Computing and Engineering.
- North America > United States > California > Santa Clara County > Santa Clara (0.31)
- North America > United States > Indiana > Henry County > New Castle (0.11)
IU alum gives IU $60M for an artificial intelligence center
An Indiana University alumnus who founded a technology company has donated $60 million to IU for the creation of an artificial intelligence center. ServiceNow founder Fred Luddy made the donation, which is the second-largest private gift IU has ever received. ServiceNow is a cloud computing company based in Santa Clara, California. IU says Luddy's $60 million donation will finance the creation of an artificial intelligence initiative focused on digital health. It will be based in what's now known as the IU Luddy School of Informatics, Computing and Engineering, but will be renamed the Luddy School of Informatics, Computing and Engineering.
- North America > United States > California > Santa Clara County > Santa Clara (0.31)
- North America > United States > Indiana > Monroe County > Bloomington (0.11)
- North America > United States > Indiana > Henry County > New Castle (0.11)